首页> 外文OA文献 >Generating Holistic 3D Scene Abstractions for Text-based Image Retrieval
【2h】

Generating Holistic 3D Scene Abstractions for Text-based Image Retrieval

机译:生成基于文本的图像检索的整体三维场景抽象

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Spatial relationships between objects provide important information fortext-based image retrieval. As users are more likely to describe a scene from areal world perspective, using 3D spatial relationships rather than 2Drelationships that assume a particular viewing direction, one of the mainchallenges is to infer the 3D structure that bridges images with users' textdescriptions. However, direct inference of 3D structure from images requireslearning from large scale annotated data. Since interactions between objectscan be reduced to a limited set of atomic spatial relations in 3D, we study thepossibility of inferring 3D structure from a text description rather than animage, applying physical relation models to synthesize holistic 3D abstractobject layouts satisfying the spatial constraints present in a textualdescription. We present a generic framework for retrieving images from atextual description of a scene by matching images with these generated abstractobject layouts. Images are ranked by matching object detection outputs(bounding boxes) to 2D layout candidates (also represented by bounding boxes)which are obtained by projecting the 3D scenes with sampled camera directions.We validate our approach using public indoor scene datasets and show that ourmethod outperforms baselines built upon object occurrence histograms andlearned 2D pairwise relations.
机译:对象之间的空间关系为基于文本的图像检索提供了重要信息。随着用户使用3D空间关系而不是采用假定特定观看方向的2D关系来从区域世界的角度描述场景的过程中,主要挑战之一就是推断将图像与用户文字描述联系起来的3D结构。然而,从图像直接推断3D结构需要从大规模注释数据中学习。由于对象之间的相互作用可以简化为3D中有限的原子空间关系集,因此我们研究了从文本描述而不是图像推断3D结构的可能性,应用物理关系模型来合成满足文本描述中存在的空间约束的整体3D抽象对象布局。我们提出了一个通用框架,可通过将图像与这些生成的抽象对象布局进行匹配来从场景的文本描述中检索图像。通过将对象检测输出(边界框)与2D布局候选对象(也由边界框表示)匹配来对图像进行排名,这些候选对象是通过使用采样的摄像机方向投影3D场景而获得的。基于对象发生直方图和学习的二维成对关系建立基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号